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Evolutionary models for populations of constant size are frequently studied using the Moran 
model, the Wright-Fisher model, or their diffusion limits. When evolution is neutral, a random 
genealogy given through Kingman's coalescent is used in order to understand basic properties 
of such models. Here, we address the use of a genealogical perspective for models with weak 
frequency-dependent selection, i.e. Ns =: a is small, where s is the fitness advantage of 
a fit individual and N is the population size. When co mputing fixation probabilities, this 
leads either to the approach proposed by iRoussej (|200Sf ). who argues how to use Kingman's 
co alescent in order to study we ak s election, or t o ext ensions of the ancestral selection graph 
of lNeuhauser and Krone! (|l997l ) and iNeuhauserl |l999j). As an applica tion of this genealog ical 
approach, we re-derive the one-third rule of evolutionary game theory (|Nowak et alj , l2004 ). In 
addition, we provide the approximate distribution of the genealogical distance of two randomly 
sampled individuals under linear frequency-dependence. 



1 Introduction 



Evolutionary game theory has long been stud ied using replicato r equa t ions and deterministic 
system s by u sing an infinite population limit (jTavlor and Jonker . Il978t iMavnard-SrnTtbl . Il982t 
Nowak . 120061) . Recently, the amount of research dealing with repeated games in finite populations 
increases. In the simplest case, a population of N players (each carrying one out of a possible 
set of strategies) repeatedly chooses an opponent at random and plays a game with payoff matrix 
M. In addition, players can produce offspring and the fitness of an individual with strategy A is 
determined by the average payoff the player obtains in the evolutionary game. Since the average 
payoff depends on the frequencies of the strategies in the population, the fitness is frequency- 
dependent, and the frequency path is e.g. given by a Moran model or a Wright-Fisher model with 
frequency-dependent selection. 

When studying models for populations of individuals carrying different type s (or adopting 
different strategies), several questions are most important (see e.g. Ewensl 2004 ). What is the 
probability of fixation of a type (in absence of mutation), or what is the random distribution of 
types in mutation-selection balance? If fixation occurs, what is the distribution of the fixation 
time? When taking a sample from the population, what is its genealogical structure? In addition, 
extensions to structured populations or to the multi-type case are considered. For the questions 
on fixation probabilities and times, the theory of bir th-death processes and (one-dimensional) 



diffusions is frequently used ([Karlin and Tavlorl 119811) . Here, explicit formulas are available for 
such quantities, but these can hardly be extended to multi-dimensional cases. Another approach 
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is the use of the genealogical structure by coalescent processes (jKingmanl . Il982t iBerestvcki , l2009h . 
The advantage is that it is extensible to populations in structured or multi-type populations. In 
addition, qu estions on the genealog i cal str ucture can be asked. As an example from evolutionary 
game theory. iGokhale and Traulsen ( 2011 ) study the frequency path of many- player many-strategy 
games using such a genealogical approach. 



The current paper was motivated by the one-third rule of evolutionary game theory (jNowak et al 

20041 ). It states that in finite populations of players with two strategies A and B, strategy A is 



favored in the sense that the fixation probability of type A in a B-population is higher than under 
neutrality if and only if type A is favored in the infinite population limit at frequency 1/3. Since 
the one-third rule was found in a Moran model, it has as w ell been proven to hold in a Wright- 
Fisher model (Traulsen et al. , 20061 : Imhof and Nowak , 2006 ) and i n general exchangeab le models 
(ILessard and Ladretl . 120071 ) . An intuitive explanation was given in lOhtsuki et al.l (|2007l ) , who ar- 
gue that an individual inte racts on average with B-players twice as often as with A-players, which 
implies the one-third rule. IWu et al. I (I2OIOI) discuss violations of the one-third rule for non-linear 
frequency dependence. 



The goal of the present paper is to use a genealogical approach in the stu dy of fix a tion p rob- 
abilitics. Only iLadret and Lessard| (2007), wh o use a method developed by Rousset ( 20031 ). al- 



ready take this route; see also 



Roussetl (|2004yp , p. 92f. We complement their approach by us- 
first introduced in th e frequency- independent case 



in g the Ancestral Selection Gr aph (ASG) , 

by iNeuhauser and Krone ( 1997 ) and Krone and Neuhauser ( 1997 ). This graph extends the coa 
les cent to the selecti ve case and has been further extended to a simple frequency-dependent case 
111 iNeuhauserl (jl999h . In addition, we study genealogical distances in models from evolutionary 
game theory, i.e. frequency-dependent selection. 



The paper is organized as follows: After stating the models (and their diffusion limits), we 
re-formulate the one-third rule and our results on genealogical distances. Then, we provide two 
proofs of the one-third rule. Finally our genealogical result is proved. 



2 Models 

We are studying the solution X = (Xt)t>o of the stochastic differential equation 

dX = ((1 - X)6 A - X9 B )dt + aX(l - X)((3 - -yX)dt + y/X(l — X)dW (2.1) 

for ci,9a,9b > 0, /3, 7 £ R. Here, a is called the scaled selection coefficient, and 9a and 9b the 
scaled mutation rates, and X gives the frequency path of type (or strategy) A in a population of 
constant size. For this diffusion, we define in the case 9a = 9b = 

Ti := inf{t > : X t = 1} 

and note that T\ < 00 if and only if type- A eventually fixes in the population. In our analysis, 
the version of (|2.1[) without mutation and without fluctuations, 

dx = ax(l -x){fi -~/x)dt (2.2) 

will also play a role. 

2.1 A finite Moran model 

We are turning to the frequency path of strategy (or type) A in a time-continuous Moran model 
of size N , which converges to X as N — > 00. We distinguish four cases, corresponding to the signs 
of (3 and 7. Setting = ,5/7 (which is an equilibrium point in (|2.2|l ). the four cases are: 

(i) (3 > and 7 > 0, which leads to a stable fixed point in x* for ()2.2|) (if x* G (0, 1)), 
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(ii) j3 < and 7 < 0, which leads to an unstable fixed point in x„ for (|2.2j) (if x* 6 (0, 1)), 

(iii) /3 > and 7 < 0, which leads x = 1 as the only stable fixed point in 

(iv) j3 < and 7 > 0, which leads x — as the only stable fixed point in 



In case (ii) and (iv), strategy A cannot invade a population consisting only of strategy B individuals 
in (| 2.2(1 and vice versa. 

The time-continuous Moran model is a Markov process with state space {A,B} N (indicating 
the types of N individuals) and the following transition rules: 

1. Every pair of individuals i and j rcsamplcs at rate 1. Upon a resampling event, the offspring 
of i (or j) replaces j (or i) with probability ^. 

2A. In cases (i) and (iii), i.e. when j3 > 0, the offspring of any A-individual replaces a randomly 
chosen individual at rate a/3. 

2B. In cases (ii) and (iv), i.e. when j3 < 0, the offspring of any i?-individual replaces a randomly 
chosen individual at rate — a/3 = a\(3\. 

3A. In cases (i) and (iv), i.e. when 7 > 0, every I?-individual picks a randomly chosen individual 
at rate cry. If it is an A-individual, an offspring of the S-individual replaces another, 
randomly chosen individual. 

3B. In cases (ii) and (iii), i.e. when 7 < 0, every A- individual picks a randomly chosen individual 
at rate — 0:7 = ce\j\. If it is another A-individual, an offspring of the first ^-individual 
replaces another, randomly chosen individual. 

4. Every individual is hit by a mutation event at rate § := 6 A + 0b- With probability 6 A /9, 
the individual turns into type A, and with probability 9b/0-, it turns to B, independent of 
the type of the parent. 

In the graphical construction of the Moran model, the transitions 1.-3. are given by arrows denoted 
T1-T3, respectively. The T3-arrows are in fact double-arrows since the t ype of one individ ual has 
to be checked (along the line which we will call the checking line; see also lNeuhauserl(|l999t) ) while 
the second arrow leads to reproduction (which will be called the imitating line). See Figures Q] 
and [2] for illustrations of the transitions 1., 2. and 3. By now, it is a classical result that the 
frequency path of the Moran model converges weakly (wi th respect to the t opolo gy of uniform 
convergence on compact sets) to solutions of (|2.1|) ; see e.g. lEthier and Kurtz ( 1993h . 



2.2 Connection with evolutionary games 

Consider a game with payoff matrix 
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There arc ./V players, each one adopting either strategy (or type) A or B. If the frequencies of 
type A and type B are Xa and Xb, respectively, the fitnesses are dependent on the payoff matrix 
and are given by 

f A := ± + s(aX A + bX B ) = \ + s(b + (a - b)X A ), 

!b := \ + s{cX A + dX B ) = \ + s{d +(c- d)X A ). 

Note that aX A + bXs is the average payoff of a type-A individual when playing against a random 
player from the population. 

Here, the offspring of every individual of type A replaces a randomly chosen individual at rate 
f A . Accordingly, a type-i? offspring replaces a randomly chosen individual at rate fs (given that 
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Figure 1: Three different kinds of events occur in the Moran model which has limit (|2.1|) . (1) For 
every (ordered) pair of individuals i and j, a Tl-arrow occurs at rate i. Here, the offspring of 
i replaces individual j (independent of the types of i and j). (2) T2-arrows are the first of two 
selective arrows. In cases (i) and (hi) (cases (ii) and (iv)), they occur at rate a/3 (at rate —a/3). 
Upon such an event, the offspring of individual i replaces j, given that i has type A (type B). 
(3) T3-(double-)arrows, occur in case (i) and (iv) (cases (ii) and (hi)) at rate cry (at rate —cry). 
Here, the imitating arrow is created from i to j, as well as the checking arrow from i to k. Given i 
has type B (type A) the checked individual k has type A (in both cases), individual j is replaced 
by an offspring of individual i. 



t 



Figure 2: The graphical representation of the Moran model (without mutation). Here, every pair 
of individuals is connected by a Tl-arrow at rate 1, every individual is the origin of a T2-arrow at 
rate a|/3|, and every individual is origin of a T3-doublc-arrow to one checking and one imitating 
line at rate a\^\. Types/strategies are inherited along the arrows according to the rules 1.-3. 
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s is small enough such that the rates are non- negative). The frequency path of type- A, follows in 
the infinite-population limit, N — > oo, 

dx = sx(l — x)(b + (a — b)x — d — (c— d)x)dt = sx(l — x)(/3 — jx)dt 

with 

fi:=b-d, 7 — 6-d + c-a. (2.4) 

Moreover, when N — > 00, s — > such that Ns — > a, and time is rescaled by a factor of N, 
the frequency path of type-^4 follows the SDE as given in ()2.1|) . Since the Moran model from 
Section 12.11 has the same diffusion limit, we use its graphical representation in order to obtain 
insights into the evolutionary game just described. 



3 Results 

Starting by re- formulating the one-third rule in Section [3. 1[ we give our result on the distribution 
of genealogical distances in the Moran model from Section 12.11 in Section 13.21 



3.1 The one-third rule 

We are now ready to formulate the one-third rule. For 9 a = Qb =0, we define 

p fix (e,a) := P(Ti < oo\X = e), 

which is the fixation probability of type A if it starts in frequency e and selection intensity is 
a. Note that pfi x (e,0) = e, i.e. the chance that strategy A fixes equals its starting frequency, if 
evolution is neutral. 



Theorem 1 (One-third rule. lNowak et all (|2004j )). Let X = (X t ) t > be the solution of (|2.1j) with 
®a = Ob = 0. Then, 

In particular, pfi x (e, a) > Pfi x {s, 0) for a, s small enough, if and only if 
i. e. the fitness of type A is above average at frequency x = -| . 

In Section [5l we will provid e two different proofs of this result. The first proof goes back 
to Ladret and Lessardl ( 2007 ). the second one uses the ancestral selection graph which we in- 



troduce in Section |U 
3.2 Genealogical distances 

In order to precisely formulate our results on genealogical distances, we are considering the Moran 
model from Section [2.11 including mutation, i.e. 9 a, 9b > 0. Assume that the Moran model 
(which evolves according to the rules 1.-4.) has run for an infinite amount of time. Then, every 
pair of individuals has a most recent common ancestor which can be read off from the graphical 
representation; see Figure [5J 

Here, we denote by P./v.a the distribution of the Moran model with population size N and 
selection coefficient a in equilibrium, and by a the corresponding expectation. Note that the 
Moran model is finite and hence, the equilibrium exists and is unique. Moreover, we denote by R12 
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the genealogical distance of two randomly sampled individuals from the equilibrium population. 
In the case a = 0, recall that the distance of two randomly sampled individuals is exponentially 
distributed, and hence, E7v. Q= o[e _A - Rl2 ] iV ~*' oc > 1/(1 + A). The following result generalizes this fact 
to small a, i.e. weak selection. 

Theorem 2 (Genealogical distances under weak frequency-dependent selection). 
Let Ejv,a ond R12 be as above. Then, with 9 = 9 a + 6b , 

JimE*, B [e- »] = — - a 7 — — - — - - - — - - - — - - - + 0(« ). 



In particular. 



lim E WQ [i? 12 ] = l + a 7 — - w 2+ ? =z; + 0{a 2 ) 



Remark 3.1. 1. Note that - up to first order in a - the change in the genealogical distance 
only depends on 7 but not on (3. This is not surprising since it has been shown that in the 
case of frequency-independent selec tion, = 1, 7 = 0, the change in the genealogical distance 
is of order a 2 ; see Theorem 4.26 in Krone and Neuhauserl ( 19971 ). However, this leads to an 



interesting effect: assume that /? > 7 > 0. Then, type A is selectively advantageous at any 
time during the evolution of the population. So, one might guess that type- A individuals have 
a higher chance to get offspring and genealogical distances are shorter than under neutrality. 
However, as the result shows, up to first order in a, genealogical distances are larger than 
under neutrality. The reason is that for 7 > 0, selection is strongest by frequent interactions 
between A and B individuals, and such interactions require a high heterozygosity, which in 
turn requires larger genealogical distance. 

2. In our proof, we implicit ly take advan t age o f the recently developed theory o f tree- valued 
stochastic processes from Greven et al. ( 2012 ) and Depperschmidt et al.l ( 2012 ). The idea is 



to describe the evolution of the genealogical distance of two randomly sampled points. If no 
events hit the two lines, the distance grows at constant speed. If a Tl-arrow falls in between 
the two sampled individuals, their distance is reset to 0. Finally, with high probability, the 
two sampled lines are hit by T2- or T3-arrows only if the arrows originate from one of the 
N — 2 other individuals. Once the evolution of the distance of two randomly sampled lines 
is given, we only have to find a(n approximate) fixed point in order to show Theorem[21 The 
details are given in Section [6] 



4 The ASG for linear frequency-dependent selection 



The a ncestral selection graph (ASG) was introduced bv lNeuhauser and Krone! ( 19971 ) and lKrone and Neuhauser 



(|1997l ) in or der to study genea logies in the case of frequency-independent selection. Later, it was 



extended in Neuhauser (1999) to a model of minority-advantage, a special form of frequency 



dependence in an infinite alleles setting. 

Here, we introduce the ASG in order to give a proof of Theorem Q] For its construction, 
consider again Figure [2j If a sample of individuals is drawn at the top of the figure, one can 
read off all events which finally determine their types. Because there are three different kinds 
of arrows, the history of the sample as well comes with three different events. First, Tl-arrows 
between lines in the sample lead to coalescence events, because common ancestors are found along 
such events. Second, T2-arrows mostly origin from lines outside the sample. Since they as well are 
determinants of the types in the sample, such events lead to branching events into a continuing 
and an incoming branch. Third, T3-arrows lead to splits of a line into the continuing, the checking 
and the imitating line. 

This informal description turns into the following stochastic process: Starting with n lines, the 
following transitions occur: 
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1. Every pair of lines coalesces at rate 1. 

2. Every single line splits in two (called the continuing and incoming line) at rate a|/3|. 

3. Every single line splits in three (called the continuing, checking and imitating line) at rate 
a|7|. 

Consider again the graphical representation of the Moran model from Figure O Here, starting 
with n lines at the top of the figure, it might be that the last event which hits one of the n lines is 
a T2- or T3-arrow, which originates from one of the n lines. However, since TV is assumed to be 
large, and the TV — > oo limit gives (|2.1|) . this case can be ignored in the limit. 

Once this graph is run for time t (i.e. from time t down to time 0), and the starting frequency 
of type A (at time 0) of the forward process is given, we can determine the configuration of the 
n lines. First, assign randomly each of the lines of the ASG at time with A according to the 
starting frequency. Then, go through the ASG from time up to time t. The types arc inherited 
along every Tl-arrow. Moreover, we have to distinguish the inheritance rules in the four cases, 
(i)-(iv). 

2A In cases (i) and (iii), i.e. when (3 > 0, the type of the incoming branch is inherited if it has 
type A. Otherwise the type of the continuing branch is inherited. 

2B In cases (ii) and (iv), i.e. when (3 < 0, the type of the incoming branch is inherited if it has 
type B. Otherwise the type of the continuing branch is inherited. 

3A In cases (i) and (iv), i.e. when 7 > 0, the type of the imitating branch is inherited if it has 
type B and the checking branch has type A. Otherwise the type of the continuing branch is 
inherited. 

3B In cases (ii) and (iii), i.e. when 7 < 0, the type of the imitating branch is inherited if it has 
type A and the checking branch has type A. Otherwise the type of the continuing branch is 
inherited. 

It is important to understand that these rules are reminiscent of the corresponding transitions in 
the Moran model. As an example, consider Figure EJD)- (Here, the ASG is given starting with 
a single line at the top, n = 1.) In case (i) and (iv), the type of the line at the top is A only if 
both lines at the bottom have type A. If the first line is B, the continuing branch of the T3-event 
is followed which carries type B. If the first line is A and the second line is B, rule 3A says that 
the type of the imitating branch is inherited, hence the line at the top has type B. In case (ii) 
and (iii), the line at the top is A if and only if the first line at the bottom is type A. In this case, 
the imitating line can only be used if it has type A as well, leading to type A at the top. If the 
first line carries B, the continuing line is used at the T3-event, leading to type B at the top. In 
Table [1] we assume that the frequency of type A at time is e -C 1, and we give all possibilities 
which lead to fixation of type A, which require at most one type- A individual in the past, and 
thcire respective probabilities up to order s. 



5 Two proofs of Theorem [T] 

5.1 A proof based on Roussetl ( 2003 ) and Ladret and Lessard ( 2007 ) 

Although the proof of Ladret and Lessard ( 20071 ). which is based on ideas from Rousset (|2003f ) 
is carried out in a time-discrete model, the same argument works for the SDE ()2. 1|) . Again, we 
write E e a [.] for expectations with Xq = e and fitness coefficient a. First, (|2.1j) is continuous in 
the parameter a, such that E £:Q [/(A t )] = E e ,o[/(X t )](l + 0{a)) for / g C h 2 (M). Moreover, by the 



5 TWO PROOFS OF THEOREM ?? 



S 



present 



(A) 



(B) 



(C) 



(D) 



(E) 



(F) 



past 

Figure 3: All different possible events/genealogies in the ancestral selection graph (ASG) connected 
to (|2.ip . Time in the ASG is running from top to bottom, i.e. from the present to the past. (A) 
In this case, when picking a single line in the present, its type only depends on the type of its 
ancestor, which is the single line at the bottom. (B) A T2-arrow leads to a split of a single line into 
a (straight) continuing line and an incoming line. In case (i) (case (h)), if the incoming line has 
type A (type B), the type at the top is type A (type B) as well. If the incoming line has type B 
(type A), the type at the top equals the type of the continuing line. Here, the type of the line in 
the present depends on the types of both lines at the bottom. (C) A T3-arrow leads to a split of 
a single line in the checking (c), imitating (i) and continuing line. Here, in case (i) (case (ii)), if 
the checking line is type A (in both cases), and the imitating branch is type B (type ^4), the type 
at the top is type B (type A) as well. In all other cases, the type at the top equals the type of the 
continuing line. (D), (E), (F) In addition to a T3-branching event, coalescence of two lines might 
happen, leading to two lines present in the ASG at the bottom of the time window. 
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Case (ii): in which cases does A fix? 
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Case (ii): P[A fixes Q = g, X = e] 
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Case (iii): in which cases does A fix? 
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Case (iii): P[^4 fixes Q = g,X n = e] 
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Case (iv): in which cases does A fix? 
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Case (iv): P[A fixes Q — g, Xq = e] 
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Table 1: All possible genealogies close to time in the ASG for small selection strength a and a 
small starting frequency Xq = £<1. In the second (fourth, sixth, eighth) line we indicate which 
of the lines at the bottom has to be type A such that type A fixes in case (i) (case (ii) , case (iii) , 
case (iv)). In the third (fifth, seventh, ninth) line we give the corresponding probabilities (with 
error terms of order £ 2 ). 
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duality of the Wright-Fisher diffusion, i.e. (|2.ip with a = 0, to Kingman's coalescent, 
E Et0 [X t (l-X t )} 
V efl [X 2 {l-X t )] 

as e — !• 0. Here, the first equation arises since 2X t (l — X t ) is the probability to obtain two different 
types when picking from the population. This event requires that the two sampled individuals 
do not share an ancestor between times and t. Moreover, both ancestors have to have different 
types. For the second equality, when sampling three individuals, two of which are type A, there 
are two possibilities. Either no coalescence of the three sampled lines occurred between times 
and t, or two of three lines coalesced and have a type A-ancestor. Combining these results, 

P £ , a (A fixes) = Ee.a^oo] = e + j —E £ , a {X t ]dt 

pOO 

= e + a E e , [Z t (l - X t ){fi - jX t )}(l + 0{a))dt 
Jo 

r°° i 

= e + a (p e -t(e + 0(e 2 ))- -^(1 -e- 2t )(e + 0(e 2 )))(l + 0(a))dt 
Jo * 

= e(l + a{p-^)) + <D(a 2 ,e 2 ), 
which finishes the first proof. 

5.2 A proof based on the ASG 

Let us prove Theorem[T]by using the ASG. Here, type A eventually fixes if and only if the common 
ancestor of all individuals carries type A. Therefore, we have to study the ASG which has run for 
an infinite amount of time (i.e. from time infinity back to time 0). Since a is assumed to be small, 
we can assume that the ASG has only a single line at time with high probability. However, with 
probability of order a, the ASG has split close to time but the resulting lines have not fully 
coalesced yet. Since a is small, we can assume that the last split event happened when the ASG 
only had a single line. 

In Figure |3l we list all possible ASGs near time which we have to consider. Start with case 

(B), which is a split in two lines. We abbreviate this genealogy by |~| . Noting that such split events 
occur at rate a\/3\, and coalescence of the resulting two lines happens at rate 1, the probability for 
this case is (by competing exponential clocks) 

p ( h ) = r+^p| + = + ( 5A ) 

case when the last split occurs into three lines, which do not coalesce up 
Since coalescence of any pair happens at rate 3, we obtain 

p (ttl ) = 3T^i +C,( " 2) = £ 1r 1+0(Q2> - <«> 

Finally, the last split can lead to three lines and two of them coalesce up to time 0. Since all three 



= e-*e(l-£) = e- t (e + 0(e 2 )), 

= e- 3t e 2 (l -e) + f ae" 3 ^-^ \e(l - e)ds 
Jo 3 

= 1 e-t(l-e-*)(s + 0(s 2 )) 



Similarly, consider the 
to time (case H~j ). 
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possible genealogies (denoted fjj"~| , |4^1 and pp ) have equal probability, we obtain 

1_o|t|_ 
3 1 + 7| 



P 



Last, we have for the remaining case j 



P 



!)= i - p (h)- p (rti)- p (^i)- p (rt 1 )- p (^) + ^ 2 ) (5.4) 

= l-a(\P\ + ^\ 1 \)+0(a 2 ). 

Using Table [T] for the probabilities of fixation conditioned on the genealogy, we can now collect all 
terms in the four cases. We obtain 



Case (i): Pfix (e, a) = e ■ P ( j ) + 2e • p( ) + e ■ p( ) + e ■ p( ftf 

+ e-P( |"p ) +0(a 2 ,e 2 ) 
Case (ii): p fix (e, a) = e ■ p( | ) +e-p( ffl ) + e ' p ( ^ ) + e-P( ft 



+ 2e-p( ft ) +0(a 2 ,e 2 ) 

Case (iii): p fix (e, a) = e • p( | ) +2e-p(^) + e .p( ft )+£-p( ft ) 

+ e-p( ft ) +2e-p( ft )+0(c*V) 

Case (iv): p fix (e, a) = e • P ( j ) + e • p( ft ) + e • p( ft ) + e • p( ft ) +G(a 2 ,e 2 ) 
Plugging in the probabilities from (|5.ip , (|5.2[) . (|5.3[) and (|5.4|) . we obtain in all cases 

p fix (e I a)=e(l + a( J 8- 7 i)) + 0(a 2 ,e 2 ) (5.5) 
which finishes the second proof of Theorem [TJ 

6 Proof of Theorem [2] 

The proof is based on the graphical construction of the Moran model; see Figure [2j We have to 
take into account all three different kinds of arrows. Let R12 = Ri2(t) be the random distance of 
two individuals taken at random from the Moran model at time t. In addition, Z3 = Z^{t) and 
Z A = Z±(t) are the types of two additionally sampled individuals. We claim that in case (i) 

^E^e"**"] = -XE N , a [e- XR ^} + 1 - E N , a [ e - XR ^} 

+ a| 7 | ■ E w , Q [l Z 3 =B l Zj=A (e- Afl23 - e~ XR ^)\ + 0(1/N). 
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Here, the first term describes the increase of the genealogical distance of individuals 1 and 2 if no 
events occur. Second, Tl-arrows between 1 and 2 lead to Rn = (hence e~ XRl2 = 1) at rate 1. 
Third, the origin of T2-arrows is with high probability (1 — 0(1/N)) neither individual 1 nor 2, but 
a third individual, with type Z%. However, this arrow only takes effect if Z% = A. Last, T3-arrows 
again originate from a third individual with type Z3 . It picks a fourth individual with type and 
if Z3 = B and Z^ = A, the arrow takes effect. All terms require that the two/three/four chosen 
individuals are distinct; hence the error term 0(1/N). 

Our goal is to approximate the right hand side of (|6.1|) . We note that the ASG implies that 
for all expectations on the right hand side we have that Ejv jQ: [.] = Ejy,o[-] + 0(a) for small a and 
large TV. Hence, we can use Ejv,o[-] as an approximation which is valid up to order a. 

Recall that the mutation rate from A to B is 6* B /2 and from B to A is a/2, as well as 
:= A + Ob- We note that for N — > 00, we can use Kingman's coalescent, since it is known to 
give genealogies in the large population limit of the neutral Moran model. We write E[.] for the 
expectation when using Kingman's coalescent and finally argue that Ejv,o[-] = + 0(1/N). We 
obtain 

_, ri 1 , Ob/2 a a /2 b a b 1 

ih[Lz 1 =B±Z 2 =A\ — - t. ■ -jT 



E[l Zl=A e- AKl2 ] = E[l Z3=A e 



1+0 1+0 1 

A 1 



1 + A' 



Here, the first equality holds since Z\ = B and Z 2 = A if and only if the first mutation event 
occurs before the two sampled lines coalesce. This mutation event must either be B — > A in 
individual 1 or B — > A in individual 2. Using similar arguments, we obtain 

E[l M l M e-^>] = f(l + &)e-^ ■ ( l±Mg+Mg . 

Jo \ 1 + 1) 

+ ^E[l Za=B e-*<«+K»>] + e -^E[l Zl=A e-^ +R ^])dt 

1 + u 1 + o I 



B 



1 



{i + e + \){i + \y 

poo 

E[U a=B lz,=Ae- XH »] = / (3 + 0>-(W 

Jo 



-L^E[l Zl=A l Z2=B e-~^ t+R ^} + -LE[l Z2=A l Z3=B e- At ] 

+ ^Jl n i Zi=B e-^^] + e J^_E[l Z3=A e-*+^])dt 
A B 1(1 11 



3 + 6» + AV(l + 6» + A)(l + A) l + 6» 1 + A 
/2 + + X 1 1 



\i + + X{l + + \){l + X) {i + + \){0 + 1) 
Wz 2 =Alz s = B e~ XRl2 } = E[l Z2=B l Z3=A e- Ai H 
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/•OO 

Jo 



+ ^E[l M e-*<"**>] + ^Ell^e-^)])* 
Qa^b 1 / 1 2 + + A 4 



6 + <9 + AVl + <9 3 + (9 + A(l + (9 + A)(l + A) 

4 1 



(3 + 6> + A)(l +9) 1 + A 

gAgg / 7 + (9 + A 1 

9 \6 + e + \(3 + e + X)(i + 9) 

2 + + X5 + 8 + X 1 

+ 



3 + e + X6 + e + x(i + e + a)(i + a) 

i 



(6 + + A)(3 + + A)(l + A) 
Plugging the last two terms in (|6.ip we find that in equilibrium 

E N , a [^ XRl2 ] = T^-T + ^rAv N v[l Z2=A l z , =B e- XR ^ - l Z3 = A l Zl = B e- XR ^-] + 0{a 2 ) 

1 -r A 1 + A 

1 a|7| (9 A 6> B 1 ^2 + + A 1 

)0 + A) 

)+0(a 2 ) 



1 + A 1 + A 6 6 + 6» + AV3 + 6» + A(l + 6i + A)(l + A) 

1 1 



(3 + + A)(l + 0) (3 + + A)(l + A). 
a| 7 | A B 1 2 + + A/ 1 



1 + A 1 + A 6 6 + 6 + X'i + 6 + X\{l + 6 + X){l + X) 

0(a 



1 



(1 + 0)(1 + A) 

1 + A 1/1 9 (6 + + A)(3 + + A)(l + + A)(l + 0)(l + A) 2 1 ; ' 
In particular, using a derivative according to A at A = 0, 

In case (ii), (|6.1I) changes to 

^ Nta [e- XR ^ = -XE N , a [e- XR ^] + 1 - E N , a [e- XR ^} + a\f3\V N , a [l Z3 = B ( e - XR ^ - e"^)] 

+ «| 7 |E^ Q [l Z3=A l Zl=A (e- AH23 - e - A/? -)] + 0(l/iV). 
As above, the term behind cx\j3\ vanishes. Moreover, 

^N,a[^Z 2 =A^Z 3 =Ae^ XRl2 } = E Nt a[l Z2=A e~ XRl2 ] - Ejv,a [i-Z 2 =A lz 3 =-B e ~ 12 ] J 

E^ Q [lz 3 =Alz 4 =Ae- Ai?12 ] - Ejv, Q [l Z4=A e- Afll2 ] - Ejv, Q [l Z3= .Blz 1=A e- Ai H 
Therefore, in equilibrium, 

E N , a [ e - A%2 ] = — ^- + -^lLE^o[lz 2 =Alz3=Ae- Aiil2 - lz 3 =Alz 4 =Ae- A/il2 ] + 0(a 2 , 1/JV) 

1 T" A 1 T" A 

rl E^o[lz 2 =Alz 3 =Be" Ai?13 - l Z3= Alz 4 = B e- Ai? -] + 0(a 2 ,l/7V). 



1+A 1+A 
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Using the result from case (i), we are done with case (ii). Cases (iii) and (iv) are similar, because 
the term behind a\(3\ vanishes. This finishes the proof. 
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